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Abstract. We study initial cuts of models of weak two-sorted Bounded Arithmetics with 
respect to the strength of their theories and show that these theories are stronger than 
the original one. More explicitly we will see that poly logarithmic cuts of models of V° are 
models of VNC 1 by formalizing a proof of Nepomnjascij's Theorem in such cuts. This is 
a strengthening of a result by Paris and Wilkie. 

We can then exploit our result in Proof Complexity to observe that Frege proof systems 
can be sub exponentially simulated by bounded depth Frege proof systems. This result 
has recently been obtained by Filmus, Pitassi and Santhanam in a direct proof. As an 
interesting observation we also obtain an average case separation of Resolution from AC°- 
Frege by applying a recent result with Tzameret. 



1. Introduction 

This article is on the one hand on models of weak arithmetics and on the other on proof 
complexity, i.e. the question of how long formal proofs of tautologies have to be in given 
proof systems. Therefore the introduction will consist of two parts, one for each subject. 

Models of weak arithmetics, like /An, have been extensively studied for several reasons. 
They are possibly the simplest objects whose theories bear enough strength to do a good part 
of mathematics in, yet they are weak enough to allow for a certain kind of constructiveness. 
The latter has been demonstrated over and over again by various results connecting weak 
arithmetic theories with complexity classes and computability. We are interested in the 
strength of the theory obtained by restricting our objects of reasoning to a small initial 
part of a given model. Since a two-sorted theory, such as V , is much stronger on its 
number part than on its set part, it is likely that such a cut is a model of a supposedly 
much stronger theory. Indeed we will see in Section [3] that certain cuts of models of V° 
are models of the provably stronger theory VNC 1 . This strengthens a result by Paris and 
Wilkie |18||17j. who show that such cuts are models of VTC°. In fact they work in a 
more general setting and, following our argumentation, their result readily implies the sub 
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exponential simulation of TC -Frege by AC -Frege from Bonet, Domingo, Gavalda, Maciel, 
and Pitassi [3]. 

Proof Complexity, on the other hand, more or less began when Cook and Reckhow [8] 
discovered the close connection between the lengths of formal proofs in propositional proof 
systems and standard complexity classes. This connection yields a possibility of dealing 
with the coNP/NP question by asking, whether there exists a propositional proof system 
that is polynomially bounded. We will not directly address this question here, but rather 
explore the relative strengths of two major proof systems, Frege and bounded depth Frege. 
These proof systems have been extensively studied, due to their natural appearance as 
classical calculi, such as Gentzen's PK, and it is well known that Frege systems are stronger 
than bounded depth Frege systems, as the former system has polynomial size proofs for the 
Pigeonhole Principle (see [5]), while the latter does not (see [H] and [H]). Lately, Filmus, 
Pitassi and Santhanam [10] have proved a sub exponential simulation of Frege by bounded 
depth Frege using a combinatoric argument. In Section 0] we will obtain the same result 
by an application of our result about cuts to the provability of the Reflection Principle for 
Frege in bounded depth Frege. Currently Cook, Ghasemloo and Nguyen [6j are working on 
a purely syntactical proof that gives a slightly better result with respect to the strength of 
the simulated proof system. 

The paper is built-up as follows. In section [2] we briefly recapture some basics about 
Complexity Theory, Bounded Arithmetic, Proof Complexity and the various connections 
between them. As this is only expository it might be helpful to consult some of the references 
for a more detailed introduction (see pQ, [7] and [12] ). After that, in Section [3] we prove 
a formalization of Nepomnjascij's Theorem in the polylogarithmic cut of a model of V°. 
Using a standard algorithm for evaluating circuits and then applying the formalized version 
of Nepomnjascij's Theorem we can conclude that this cut is indeed a model of VNC 1 . 
Finally, in Section HI we apply this result to prove that a version of the Bounded Reflection 
Principle of Frege is provable in V . This, together with a standard argument linking 
the provability of Reflection Principles with simulation results, yields the sub exponential 
simulation of Frege by bounded depth Frege. 

2. Preliminaries 

We assume familiarity with Turing machines, circuits and standard complexity classes such 
as P, NP, TimeSpace(/, g), NO, AO and so on. See for example [1] for an introduction. 
We will not work a lot within these classes, but rather apply known relations between such 
classes and weak arithmetic theories. 

We will work in a two-sorted arithmetic setting, having one sort of variables representing 
numbers and the second sort representing bounded sets of numbers. We identify such 
bounded subsets with strings. See [7] for a thorough introduction. The underlying language, 
denoted C\, consists of the following relation, function and constant symbols: 

{+,-,<, 0,1, |-|,=i, = 2 , G} • 

An /^-structure M consists of a first-sort universe U^ 1 of numbers and a second-sort 
universe U^f of bounded subsets of numbers. If M is a model of the two-sorted theory V° 
(see I2.2p , then the functions + and • are the addition and multiplication on the universe of 
numbers. and 1 are interpreted as the appropriate elements zero and one with respect to 
addition and multiplication. The relation < is an ordering relation on the first-sort universe. 
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The function |-| maps an element of the set sort to its largest element plus one (i.e. to an 
element of the number sort). The relation =1 is interpreted as equality between numbers, 
=2 is interpreted as equality between bounded sets of numbers. The relation G holds for a 
number n and a set of numbers N if and only if n is an element of N. The standard model 
of two-sorted Peano Arithmetic will be denoted as N2. It consists of a first-sort universe 
U\ = N and a second-sort universe U2 of all finite subsets of N. The symbols are interpreted 
in the usual way. 

We denote the first-sort (number) variables by lower-case letters x, y, z, . . . , a, f3, 7, . . . , 
and the second-sort (set) variables by capital letters X,Y, Z, . . . , A, B,T, . . . . In case it helps 
to describe the meaning of a variable we will use lower case words for first-sort and words 
starting with a capital letter for second-sort variables. We can build formulas in the usual 
way, using two sorts of quantifiers, number quantifiers and string quantifiers. A number 
quantifier 3x (Vx) is bounded if it is of the form 3x(x < f A. . . ) (Vx(x </—>...)) for some 
number term /. A string quantifier 3X (VA) is bounded if it is of the form 3A(|A| < /A. . . ) 
(VA(|A| < / — > . . . )) for some number term /. A formula is bounded iff all its quantifiers 
are. All formulas in this paper will be bounded. A formula ip is in (or il^) if it uses no 
string quantifiers and all number quantifiers are bounded. A formula ip is a ^f + i (or II^_ 1 ) 
if it is of the form 3X\ < p(n) . . . 3X m < p(n)ip (or VXi < p(n) . . . VX m < p(n)ip), where 
tp G irf (or ip G E^, respectively). If a relation or predicate can be defined by both a Y,f 
and a 11^ formula, then we call it definable. The depth of a formula is the maximal 
number of alternations of its logical connectives and quantifiers. 

As mentioned before we will represent a bounded set of numbers A by a finite string 
<SjV = . . . S 1 ]^' 1 such that S l N = 1 if and only if i G N. We will abuse notation and 
identify bounded sets and strings, i.e. N and SV- 

Further, we will encode monotone propositional formulas inductively as binary trees 
in the standard way, giving a node the value 1 if it corresponds to a conjunction and the 
value 0, if it corresponds to a disjunction. Binary trees are encoded as strings as follows. If 
position x contains the value of a node n x , then the value of its left successor is contained 
in position 2x , while the value of its right successor is in 2x + 1. 

2.1. Elements of Proof Complexity. We restate some basic definitions introduced in 
0- 

Definition 2.1. A propositional proof system (pps) is a surjective polynomial-time function 
P : {0, 1}* — > TAUT, where TAUT is the set of propositional tautologies (in some natural 
encoding). A string ir with P(vr) = r is called a P -proof of r. 

We can define a quasi ordering on the class of all pps as follows. 

Definition 2.2. Let P, Q be propositional proof systems. 

• P simulates Q (in symbols P > Q), iff there is a polynomial p, such that for all r G TAUT 
there is a irp with P(irp) = r, such that for all ttq with Q(ttq) = r, |7rp| < p(\itq\). 

• If there is a polynomial time machine that takes Q-proofs and produces P-proofs for the 
same formula we say that P p-simulates Q (in symbols P > p Q). 

• If P and Q mutually (p-)simulate each other, we say that they are (p-) equivalent (in 
symbols P = Q and P = p Q, respectively). 
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In this article we will be mainly interested in bounded depth Frege systems and some of 
their extensions. A Frege system is a typical textbook proof system, such as Gentzen's 
propositional calculus PK. We will only sketch a single rule of such a system as an example 
and refer the interested reader to standard logic textbooks. 

r — >■ a, a r.A — ► a , 
a (Cut) 

Here, A and T are sets of formulas while A is a formula. T — > A is read as "The conjunction 
of all formulas in T implies the disjunction of all formulas in A". The Cut Rule therefore 
says that, if T implies A or A, and T and A imply A, then T already implies A. The formula 
A is called the Cut Formula. 

In a bounded depth Frege system the depths of all formulas in a derivation are bounded 
by some global constant. This is equivalent to being representable by an AC circuit. Thus 
we also call bounded depth Frege AC°-Frege. If the formulas are unbounded, we speak of 
NC 1 -Frege or simply of Frege. We readily get 

Fact 2.1. AC°-Frege < p Frege. 

A pps P is polynomially bounded iff there is a polynomial p such that every tautology 
r has a P-proof 7r with | vr | < p(|r|). 

We are interested in the existence of polynomially bounded pps. This is, at least in 
part, due to the following theorem. 

Fact 2.2 (|8j). NP = coNP There exists a polynomially bounded pps. 

An easier task than searching for a polynomially bounded pps might be to find some 
pps with sub exponential bounds to the lengths of proofs. This corresponds to the ques- 
tion, whether sub exponential time nondeterministic Turing machines can compute coNP- 
complete languages. To explore the existence of such systems we generalize Definition | 



Definition 2.3. Let P, Q be propositional proof systems and F a family of increasing 
functions on N. 

• P F-simulates Q (in symbols P > F Q), iff there is a function / € F, such that for 
all r G TAUT there is a irp with P(ttp) = r, such that for all ttq with Q(ttq) = r, 

Kp| < /(KqI). 

• If there is an Time(i ? )-machine that that takes Q-proofs and produces P-proofs for the 
same formula we say that P F-computably simulates Q (in symbols P > F Q). 

• If P and Q mutually .F-(computably) simulate each other, we say that they are F- 
(computably) equivalent (in symbols P = F Q and P = F Q, respectively). 

We say a pps P sub exponentially simulates a pps Q iff the above F can be chosen as a class 
of 2 n ° 1 functions. 

2.2. The theory V°. The base theory we will be working with is V°. It consists of the 
following axioms: 
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Basic 1. x + 1 7^ Basic 2. x + l = y + l— s>x = y 

Basic 3. x + = x Basic 4. x + (y + 1) = (x + y) + 1 

Basic 5. x • = Basic 6. x ■ (y + 1) = (x ■ y) + x 

Basic 7. (x<yAy<x)— >-x = y Basic 8. x < x + y 

Basic 9. < x Basic 10. x < y V y < x 
Basic 11. x<y-H-x<y + l Basic 12. x ^ — > 3y < x(y + 1 = x) 

LI. X(y)^y<\X\ L2. y + 1 = \X\ -> A(y) 

se. (|jsf| = |y | a v« < |x| (x(t) <-> -)■ x = y 

E fi -COMP. < yVz < y(X(z) ip(z)) , for all ip £ . 



Here, the Axioms Basic 1 through Basic 12 are the usual axioms used to define Peano 
Arithmetic without induction (PA - ), which settle the basic properties of Addition, Mul- 
tiplication, Ordering, and of the constants and 1. The Axiom LI says that the length 
of a string coding a finite set is an upper bound to the size of its elements. L2 says that 
\X\ gives the largest element of X plus 1. SE is the extensionality axiom for strings which 
states that two strings are equal if they code the same sets. Finally, S^-COMP is the 
comprehension axiom schema for S^-formulas (it is an axiom for each such formula) and 
implies the existence of all sets, which contain exactly the elements that fulfill any given 
T,q property. 

Fact 2.3. The theory V° proves the Induction Axiom schema for formulas <£: 

($(0) A Vx($(x) -> $(x + 1))) ->■ Vz $(z). 

When speaking about theories we will always assume that the theories are two-sorted 
theories as in [7]. 

The following is a basic notion: 

Definition 2.4 (Two-sorted definability). Let T be a theory over the language C D C\ 
and let $ be a set of formulas in the language C. A number function / is ^-definable in a 
theory T iff there is a formula (f(x, y, X) in $ such that T proves 

VxVX3!y( / j(x, y, X) 

and it holds that 

y = f(x,X) <-np(x,y,X). (2.1) 

A string function F is ^-definable in a theory T iff there is a formula f(x, X, Y) in $ such 
that T proves 

V24X3\Y<p(x, X, Y) 

and it holds that 

Y = F(x,X)^<p(x,X,Y). (2.2) 

Finally, a relation R(x, X) is ^-definable iff there is a formula (p(x, X) in such that it 
holds that 

R(x, X) o (p(x, X). (2.3) 
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Moreover we wish to talk about sequences coded by strings or numbers. For a string 
X we let X[i] be the ith bit of X. Assuming a tupling function (•,...,•) we can also talk 
of fc-ary relations for any constant k. We refer to X[(iq, . . . , ik-i)], to say that the objects 
io, ■ ■ ■ , ik-i are in the relation X (which is equivalent to saying that the predicate X holds 
for the number (io, . . . , ik-i), i-e. that the X contains that number as an element). For the 
sake of simplicity we also refer to X[(io, . . . , i^)] by X[Iq, . . . , i^]. 

Using fc-ary relations we can also encode sequences of bounded numbers xq, . . . , x m by 
Xi = X[(i, 0)]X[(i, 1)] . . . X[(i, k)] in binary. Matrices and so on can obviously be formalized 
in the same way. 

Given a string X[(x±, . . . ,Xk)] representing a /c-ary relation, we denote the k — l-axy 
substring with parameters a^, . . . , by X [(-,...,-, a^, •)]. For example we 

refer to the element Ojj of a given matrix A[{xi, X2, x^)] as A[(i,j, ■}], a string representing 

a, ij in binary. Observe that this substring can be defined in V°. 

Given a number x we denote by (x)j the jth number in the sequence encoded by x. To 
do this we assume a fixed definable encoding of numbers that is injective. The sequence 
itself will be addressed as (x). As above, we can also talk about matrices, etc. in this way, 
i.e. read such a sequence as a sequence of A;-tuples. 

We want to identify strings of short length with sequences of numbers. Thus, given 
a string X of length 0(n) we can S^-define (in V°) a number x < 2°^ that codes a 
sequence (x), such that X[i] = (x)i for all % < \X\ and vice versa. We will use (x) pa X and 
(x)i « X[i] to denote the above identification. Observe that n has to be very small in order 
to be able to do the above in V°. 

2.2.1. Computations in models of V°. Given a polynomially bounded Turing machine A in 
a binary encoding, we can T,f define a predicate /KCCa{X), that states that X is accepted 
by A. This can readily be observed, since, provided some machine A, there is a constant 
number of states u\ , . . . , an d the whole computation can be written into a matrix W 
of polynomial size. That W is indeed a correct computation can then be easily checked, 
because the computations are only local. 

More precisely let A = (a±, . . . , o^; S) be given, where the Uj are different states, with u\ 
being the initial state and at being the accepting state and S is the transition function with 
domain {<7i, . . . , at} x {0, 1} and range {<7i, . . . , Ufc} x {0, 1} x {<— , 4-, — >}, which describes 
what the machine does. I.e. if S(a, b) = (c, d, e), then if the machine is in state a and reads 

b, it replaces b by d, goes into state c and moves one position on the tape in the direction e. 
For our formalization we will assume a function 6 : N 2 — >• N and interpret it in the following 
way, 5(<r a ,b) = ({8(<r a , (S(a a ,b)) 2 , {5(a a ,b)) 3 ), where we identify |= Q,<-= !,->•= 2. 
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Let the polynomial p bound the running time of A, then we can formalize ACCa(^0 as 
follows 

3W < {p(\X\f •log(A;) 2 )V^,i , < p(|X|)V0 < a < k( 

i<\X\^ ((W[(0,i, = X[i] A i > -4 (W[<0,i, -)]> 2 = A <W[(0,0, •}]} 2 = 1)A 

i>\X\^ ((W[(0,i, = A (J7[<0,* s -}]} 2 = 0)A 

(W[{j,i, .)]> 2 = -)• + 1,*, •)]>! = (W[0>\ ->]>i)A 

(W[0>-,-}]}2=a^((WKi + l,i,-}])l = {%,(^[{j,v)]}i))2A 
«*(a, -)])i)} 3 = -)■ (^[(j + -)]) 2 = (5(a, (^i, ■>]> 1 )>i)A 

((<f(a, ■)]) 1 )) 3 = 1 -)• + l,i - 1, .)]>a = (*(a, -}]}i))i)A 

«*(a, (^[<j,i, -)])i)} 3 = 2 + l,i + 1, ->]} 2 = (S(a, (W[(j,i, •>]>i)>i))A 

(i ^ i' -> i, ->]>2 > -)]) 2 = 0)) A WM\X\),i, •}]} 2 = fc). 

(2.4) 

Thus, in plain English, ACCa(X) says that there exists a matrix of pairs of numbers 
that witnesses an accepting computation of A. Here, (W[{i,j, ■)}) is supposed to code the 
jth cell on the Turing machine's tape after i steps of computations on input X. As noted 
above, (W[(i,j, is a binary number, which is the value of the cell, (W[(i,j, -)]) 2 is a 
number coding the state the machine is in iff the pointer is on that cell. 

The second and third line of the definition say that the tape in the initial step contains 
X padded with zeroes in the end to get the proper length (p(|X|)) and that the read/write 
head is in its starting state and position. The fourth line says that if the read/write head 
is not on cell i, then nothing happens to the content of cell i. The fifth line says that the 
content of the cell, where the read/write head is in step j, is changed according to 5. The 
next three lines tell us where the read/write head moves. The last line says that there is at 
most one position on the tape where the read/write head may be at any step and that the 
state after the last step is accepting. 

We also define a Sf -predicate REACH A (T, Y') that says that A reaches configuration Y' 
from configuration Y in at most many steps. This is essentially the same predicate as 

ACC, with the constraints on the initial and accepting state lifted and instead a constraint 
added that the first line of computation is Y and the last is Y 1 . We omit the details as it 
does not severely differ from the above definition of ACC. 

2.3. Extensions of V°. The Theory V° serves as our base theory to describe complexity 
classes by arithmetical means. 

The problem, whether a given monotone formula ip of size i and depth [log(^)] is 
satisfiable under a given assignment I is AC°-complete for NC 1 . Therefore Cook and 
Nguyen ([7]) define the class VNC 1 as V° augmented by the axiom MFV = 3Y < 
2a + 1.5MFv( a , G, I, Y), where 

5 M Fv(a, G, I, Y) = Vx < a{{Y{x + a) <-> I(x)) A K(0)A 
< x -> (Y(x) O ((G(x) A Y(2x) A Y(2x + 1))V 

(-■G(x) A {Y(2x) V Y(2x + 1)))))). 

So, MFV states that there is an evaluation Y of the monotone formula represented by 
G under the assignment given by I of length at most 2a + 1. More specifically, G is a 
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tree-encoding of the formula, where G{x) is true, if node x is A and false, if x is V. The 
evaluation Y takes the value of the variables given by I and then evaluates the formula in 
a bottom-up fashion using a standard tree encoding. Thus, the value of the formula can be 
read at Y(i). 

It is interesting to observe that MFV does not hold in V°. This is, since an application 
of the Witnessing Theorem for V° to a proof of MFV would yield an AC°-definition of 
satisfaction for monotone NC 1 circuits. This implies that monotone NC 1 C AC , which is 
known to be false. 

2.4. Relation between Arithmetic Theories and Proof Systems. In this section we 
will remind the reader of a connection between the Theory V° and some of its extensions 
and certain propositional proof systems (see also [7][12j). 

Definition 2.5. The following predicates will be subsequently used. They are definable 
with respect to V° (see [12]). 

• Fla(X) is a Yjq formula that says that the the string X codes a formula. 

• Z \= X is the definable property that the truth assignment Z satisfies the formula 
X. 

• Taut(X) is the IT^ formula Fla(X) A \/Z < t(\X\)Z \= X, where t is an upper bound to 
the number of variables in formulas coded by strings of length \X\. 

• PrfF d (II, A) is a T,q definable predicate meaning II is a depth d Frege proof for A. 

• Prf F (IM) is a Tiq definable predicate meaning II is a Frege proof for A. 

The following holds 

Fact 2.4 (see [7]). The Theory V° proves that AC°-Frege is sound, i.e. for every d 

VAVnPrf Fd (n,yl) -> Taut(A). 

Fact 2.5 (see The Theory VNC 1 proves that Frege is sound, i.e. 

VvlVnPrf F (n,yl) -> Taut(A). 

On the other hand, provability of the universal closure of T,q formulas in V° and 
VNC 1 implies the existence of polynomial size proofs of their propositional translations in 
AC°-Frege and Frege, respectively. 

The propositional translation \(p(x, X)}fh,n of a T,q formula ip(x,X) is a family of 
propositional formulas built up inductively (on the logical depth) as follows. If (p is atomic 
and does not contain second sort variables, we evaluate <p in N2, if it contains second sort 
variables, we have to introduce propositional variables. If (p is a boolean combination of 
formulas tpi of lower depth, the translation is simply the same boolean combination of the 
translations of the ifti. If <p is z\ip or we translate it to the disjunction or conjunction of 
the translations, respectively. For a proper definition see [7]. 

Fact 2.6. There exists a polynomial p such that for all T,q formulas <£>(x, X) the following 
holds 

• If V° h VXVx<^(x, X), then there exist bounded depth Frege proofs of all [</?lm,n of length 
at most p(max(m, n)), for any rh,n. 

• If VNC 1 h VXyxip(x, X), then there exist Frege proofs of all [y]m,n of length at most 
p(max(m, n)), for any fh,n. 
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These proofs are effective in the sense that for any such tp there exists a polynomial-time 
computable function F v that maps any tuple (m, n) to the above proofs of [yj^n- 

Facts |2~31 and 12.51 are examples of general principles, the so called Reflection Principles, 
which are defined as follows. 

Definition 2.6 (Reflection Principle). Let P be a pps. Then the Reflection Principle for 
P, Refp, is the VAf -formula (w.r.t. V°) 

VnVXVZ((Fla(X) A Prfp(II,X)) ->■ (Z N X)), 

where Prfp is a A;f -predicate formalizing P-proofs. 

Reflection Principles condense the strength of propositional proof systems. In what 
follows we will summarize some such results for the proof systems and theories used here. 
A detailed exposition can be found in [7J, chapter X, or in [12], chapter 9.3. 

Theorem 2.1. If V° h Refp then bounded depth Frege p-simulates Frege. 

We will only give a brief sketch of the proof here and leave out the technical details. 

Sketch. Let tp be a formula and tTw a Frege proof of ip which is witnessed by a Turing machine 
Tp (cf Def I2.ip . Since V° proves Refp, by Facts [27H and [2761 we have polynomial size proofs 
of its translations [Refp] in bounded depth Frege. Bounded depth Frege itself, however, is 
strong enough to verify that a proper encoding of the computation of Tp on input (tv v ,<p) 
is correct. Thus it can verify that tt^ is a Frege-proof and, using the translation of the 
Reflection Principle and the Cut rule, conclude [Taut(y)J. From this tp follows, cf. [12] 
Lemma 9.3.7. □ 

Given a term t and a variable x, we can also introduce the abounded version of the 
Reflection Principle for some given pps P, Refp(t(x)) that claims soundness only for t- 
bounded proofs. 

Definition 2.7 (Bounded Reflection). Let t be first-sort variable and P a 

pps. Then the Bounded Reflection Principle Refp(t(x)) is the formula 

Vn < t(x)VX < t(x)VZ < t(x)((F\a(X) A Prf P (IL,X)) -> (Z t= X)). 

We can now generalize Theorem 12.11 in the following way. 

Theorem 2.2. Let t be a /^-term and x a number variable. If t{x) < x for x large enough 
and if V° h VxRefp(t(x)) then for every propositional formula tp with a Frege proof of 
length t(x) there is a bounded depth Frege proof of tp of length x°^\ This proof can be 
efficiently constructed. 

Proof. The proof is the same as that of Theorem l2.1i Using the Bounded Reflection Principle 
we can encode Frege proofs of length t(x) as bounded depth Frege proofs of length x ^. □ 

As a corollary we get 

Corollary 2.3. If V° h Refp(|x[ fc ) for all k € N, then bounded depth Frege sub exponen- 
tially simulates Frege: For all D > 1, S > exists d> D, such that the existence of a Frege 
proof of length m of a depth D formula implies the existence of a depth d Frege proof of 
length at most 2 m . 
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3. POLYLOGARITHMIC CUTS OF MODELS OF V° ARE MODELS OF VNC 1 . 

We will first introduce the notion of a cut X of a given two-sorted arithmetic model Ai. 
This model theoretic approach provides a very good insight on what actually happens 
semantically with the small elements of arithmetical models. 

Definition 3.1 (Cut). Let T be a two-sorted arithmetic theory and 

N = {Nl ,N 2 , +* <* o* 1", \-\ N , =f , G N } 

a model of T. A cut 

M={Mi,M 2 ,+ , ■ ,< , U ,1 , |-| ,=i ,=2 >€ j 

of N is any substructure such that 

• Mi C iV 1; M 2 C N 2 , 
. o M = N , 1 M = 1 N , 

• M\ is closed under +-^, - N and downwards with respect to < , 

• M 2 = {X G N 2 | X C Mi}, and 

• o M is the restriction of to Mi and M 2 for all relation and function symbols o G C\. 
We call this cut the Poly logarithmic Cut iff 

x e Mi •£> 3a G iVi,fc G N x < \a\ k . 

To examine the strength of the theory of such cuts of models of V° , we will show that a 
formal connection between efficient computability and S^-definability holds. This stands in 
contrast to general bounded subsets, where the connection is presumably only with respect 
to E^-definability via the predicate ACC (see (|2,4p on page[7|). The intended theorem is 
a formalization of Nepomnjascij's Theorem [16] (see also [12] pg.20). We will sketch the 
original proof before starting the formalization. 

Theorem 3.1 (Nepomnjascij [E]). Let c G N and < e < 1 be constants. Then if the 
language L G TimeSpace(n c , n e ), the relation x G L is definable by a S^-formula over N. 

Proof. We will prove the theorem by induction on k for L G TimeSpace(n fc '( 1 ~ e ^, n e ). 

Let k = 1 and L G TimeSpace(n fe '( 1_e ), n e ). For any x < 2 n the whole computation 
can be coded by a number y of size 2°^ n \ The existence of such a computation gives the 
desired S^-definability. 

For k > 1 we write a sequence yo, y±, . . . , y n i-e of intermediate results coding the com- 
putation, where yo codes the starting configuration on input x, such that we can verify 
that yi+i is computable from in TimeSpace^^ 1 )^ 1-1 ^, n e ). By assumption there ex- 
ists a E^-formula reach fc_1 such that reach fc_1 (y.j, j/j+i) holds iff yi+\ is computed from y^. 
Additionally, the whole sequence has length 0{n) and so we can write the sequence of in- 
termediate results yi as a number y of length 0(n). Now, the S^-definition of x G L is 
simply 

3y < 2°WVi < n 1 - e reach fc - 1 ((y) J , (y) l+1 ) 

A (y)o encodes the starting configuration of A on input x 

A (y) n i-e is in an accepting state. □ 

We will now formalize this result in V° as follows 
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Theorem 3.2. Let N \= V°. Let m = \a\ for some a G Aq and let c, k G N and e < \. If 
L G TimeSpace(m c , m e ) (for strings of length m) is computed by Turing machine A, then 
there exists a definition in N of the S^-predicate ACC^ on the interval [0, m k \. I.e. any 
Y G L, bounded by m fc is S^-definable in N and therefore exists in the polylogarithmic cut 
of N. 

The following version of the proof stems from a discussion with Stephen Cook and Neil 
Thapen during the SAS programme in Cambridge. It is more explicit than the original one 
and clarifies the argument. 

Proof. We will inductively on d define a Y>q relation reach^(J, p\, pi, cell, comp) that states 
that the P2th cell of the work tape of A, starting on configuration / and computing for 
pi ■ m k steps via the computation comp is cell. We will bound the quantifiers in such 
a way that we can conclude that both variables can be of the number sort. As d depends 
only on A and k we will be doing this induction outside of the theory to construct d many 
formulas. We will then prove the above mentioned properties of reach^ by T,q induction on 
Pi- 

Keep in mind that a cell is given as a pair (bit, state), where bit is the actual value of 
the cell and state is a number > coding the state the Turing machine is in iff the pointer 
is on that cell and otherwise. As before the transition function is denoted by 5. We let 
I be a string coding the input at the start of the computation. That is, I is a sequence of 
length len(I) < m k , such that I[l, 1] = 1, I[j, 1] = for all j > 1 and I[j, 0] is the jth input 
bit. We let (comp) be a sequence encoding the computation of A, such that (comp)^/ ^ is 
the state, the machine is in after j steps (0 denotes that the read/write head is not on cell 
j', while a greater number gives the state and witnesses that the read/write head is on cell 
j'). (comp)/jj/ i0 ) is the value of cell f after j steps of the computation. Observe that this 
also implies that the computation can be encoded as a number, that is, it has to be very 
short. This is straight forward from the quantifier bounds. 

We can now define 
reach \ A (I,p\,p2, cell, comp) 

= (V/ < \len(I) £ ], (comp) (l!j , )0) w l[j',0] A (comp)^,^ w I[j' , 1])A 
[Vj < llenil) 1 ^],? < \len(I)%a< \A\ 

(( com P)(i,iM> = -> « com PW,o> = ( com P)<i+i,j',o>))A 
((comp) (j)/jl) =<*->( 
((5{a, (comp) (jii , i0) )} 3 = -> ((comp) (j+lii , 5l) = (5(a, (comp)^-,^)}^ 

( com P)(i+ij',o) = ( 5 (« ) ( com P)(i,j',o>))2))A 
((5{a, (comp) - /i0) )} 3 = 1 -> (comp) (j+lj .,_ ljl) = (5{a, (comp)^-,^)}^ 

(comp} 0+lj -, )0) = ((5(a,(comp) (j - J -, i0 ))>2))A 
((S(a, (comp} (j)/j0) )) 3 = 2 -»• (comp} (j+1)/+1;1) = (<5(a, (comp) (jJ , )0) ))iA 

(comp} 0+lj -, )0) = ((5(a,(comp) (j - J -i i0 ))>2))))A 
V^ / < [Zen(7) e l((comp} 0A1) > -> (comp)^^) = 0)]A 
((cell)i = (comp) (piiP2i0) ) A «cell) 2 = (comp} (pi>P2jl) ). 
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It is straightforward to prove by induction on the number of lines in comp that comp is 
uniquely defined by reach^. We let 

Reach^(7,pi,p2,cell) = def 3comp < q(\I\) reach° A (I,p 1 ,p 2 , cell, comp), 

where q is some polynomial depending on the encoding. Here it is vital that q can be defined 
such that is a number in N. This is possible due to the quantifier bounds we used 

when defining reach^. Thus, Reach^ is defined by a T,q formula. 
Informally Reach^ formalizes that there is a computation 

(comp) {1>1) .) (comp)( 1>2) .) ••■ (comp) (li(mfc)v> 

(comp)^!,.) (comp) (2i2 ,) ••■ (comp) (2i(mfc)v> 



(comp) i-fce (comp) i_ fce • • • (comp) i- fce 

that is correct in the sense that we can verify that we get from line to line via the transition 
function of A and gives the appropriate values of the cell at (pi,P2)- Observe that the size 
of the whole computation as presented above is linear in m, i.e. that it can be coded as a 
number in N. 

We will now proceed by inductively defining reach^ and Reach^. Assume that reach^ - 1 
has already been defined by a Y,q formula over N. We then let 

reach^(7,pi,p 2 , cell, comp) 

= (V/ < \len{I) e ], (comp) (1J , i0 ) » l\j',0] A (comp) (1/il) » /[/, 1])A 

(Vj < [/en(J) ^Bcomp' < q(len(I))Vj' < |7en(J) e l3cell' < |A| V/ < |A| 

«co m P)(j+ij'j"> ( cell ')i")A 

reach^ - 1 ((comp)^ ) . m — ~, /, cell', comp'))A 

((cell)i (comp) (piiP2j0) ) A «cell> 2 = (comp) (piiP2jl) ). 

Again, we can prove uniqueness of the computation by induction on the number of its lines 
and let 

Reach^(i,pi,p 2 ,cell) = def 3comp < q(\I\) reach A (i,p 1 ,p 2 , cell, comp). 

That this is a Hq definition follows by induction and the same argument as for Reach^. Here, 
the predicate Reach^ _1 (i,p;L, j> 2 , cell) takes the role of the transition function in witnessing 
that each line follows from the preceding one. The total size again is linear in m. 

We now can give a definition of the predicate W[(i,j, •)] coding the computation as 
in ACQ4 on input X of length m k . We let W[(i,j, ■)] = cell = 

3r ,...,r d < \X\~ ,coni,... ,con d < p(\X\ t )\fz 1 , . . . , z d < \X\ e Bcelli, . . . , cell d < \A\ 

d ^ 

(i = Yjre ■ \X\ e ^ A Reach^X^d^d.celld) A {con d ) Zd = ce\\ d 

A Reach^ _1 (con d ,r d _i,2; d _i,cell d ) A (con ci _i) 2;d _ 1 = cell d _i 

A Reach^(coni,ro, j, cell)), 
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where p is a polynomial depending on the encoding and X is the starting configuration of 
A on input X. 

Informally the above formula says that we compute the configurations of A by using 
the predicates Reach^ through Reach^. That is, after the application of Reach^ (i.e. after 
making the biggest steps) we have reached configuration corid, which we plug into Reach^ -1 
to get configuration cora^-i and so on. It remains to show that this definition of W[(i,j, •)] 
coincides with the real one, i.e. that W[{i + 1, •, •)] follows from an application of the 
transition function of A from W[(i, •, •)]. 

We will prove this inductively, depending on %. Again let rg be such that % = Y2i r e ' 

\X\ k . If i < \X\ k the assumption follows straightforwardly from the definition of 
Reach^. Now for bigger i. If the ro, given as above, is bigger then then again the 
assumption follows from the definition of reach^. Now let £ > be the first index with 
r£ > 0. We the have to argue that reach^ -1 has the desired property. This, however, 
follows straightforward if we can verify this assertion for reach^ -1 . Observe that d is a 
constant depending only on A and k, so we need to make this argument only a constant 
number of steps to reach reach^, where we know that the assertion holds. 

Since we can code the whole computation as a X^-formula (in N) , we can easily deduce 
a S^-definition of the related set by simply stating that the computation accepts (i.e. that 
in the last line of the computation the state is accepting). This concludes the proof. Q 

We can now prove our main result. 

Theorem 3.3. Let JVNV and M C N be the polylogarithmic cut. Then M N VNC 1 . 

Proof. We have to prove that for all strings G v € M2, representing a formula (p as a tree 
and assignments / € M2 to its variables (i.e. leafs in the tree representation) a string Y 
exists in M2 that contains all values of (p's subformulas as in the definition of MFV in 
Section [231 and satisfies the inductive conditions of MFV. 

However, by S^-comprehension and the formalized Nepomnjascij's Theorem it suffices 
to describe an algorithm that computes, for given and I, whether i belongs to Y in 
TimeSpacedGy,^ , |G^| e ) for some k G N and e < 1. 

The following is a recursive algorithm computing the value of Y[i], given G '. — G^, I 
and i. 

NodeValue(G,I,i) 

• boolean left; boolean right; 

• If ±>2- 1 G I 

- Output (0) ; End; 

• Else If i>|G| 

- Output (I[i-|G|]); End; 

• Else If G[i]=l 

- left := NodeVal(G,I,2i) ; 

- right := NodeVal(G,I,2i+l) ; 

- Output (left AND right); End; 

• Else If G[i]=0 

- left := NodeVal(G,I,2i) ; 

- right := NodeVal(G,I,2i+l) ; 

- Output (left OR right); End; 

• Else 

- Output (0) ; End; 
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Observe that the algorithm at any given point only stores a constant amount of data per level 
of the tree G and therefore uses only 0(log(|G|)) space. The number steps the algorithm 
makes is clearly polynomial in the size of G. Therefore by Theorem l3.2l for every monotone 
formula 99, representable as a tree in M, we get a formula eval^, such that eval ¥ ,(i,/) = 
Y[i]. Observe that eval^ depends on the size of ip and on its logical depth, as the first is 
essentially the size of the input for the machine, that eval^ codes, while the latter determines 
the longest iterations in the recursive algorithm. Applying the Comprehension Schema in 
V°, i.e. in N, this verifies the existence of a Y as in MFV for all formulas represented by 
trees in M.Therefore MFV holds in M and so M N VNC 1 . □ 



4. Implications for Proof Complexity 

We now wish to apply the above results to propositional proof systems. More precisely 
we wish to show that theories of small cuts of a model of a given theory T correspond to 
stronger proof systems than T does. An elegant way of showing such a statement is via the 
Reflection Principles of the given proof systems, i.e. the statement that the proof system is 
correct, as explained in Section 12.41 With their help we can conclude the following recent 
result of Filmus, Pitassi and Santhanam |10j . 

Theorem 4.1 ([10]). Every Frege system is sub exponentially simulated by AC°-Frege 
systems. 

Proof. By Theorem 12.21 we have to prove the polylogarithmically bounded Reflection Prin- 
ciple for Frege in V°. This, by Theorem 13.31 however . corresponds to proving the Reflection 
Principle for Frege in VNC 1 , which holds by Fact 12.51 It also follows from Theorem 12.21 
that this proof can be efficiently computed from the Frege proof. □ 

Another, related, application is in the separation of propositional proof systems. In |15j 
we proved the following. 

Proposition 4.2. For almost every random 3CNF A with n variables and m = c ■ n 1,4 
clauses, where c is a large constant, ->A has polynomially bounded TC°-Frege proofs. 

On the other hand it is well known (see for example [9]) that such formulas have 
no subexponential refutation in Resolution. Thus, this yields an average case separation 
between Resolution and TC°-Frege. We can now extend this result to an average case 
separation between Resolution and AC°-Frege as follows. 

Theorem 4.3. For almost every random 3CNF A with n variables and m = c-n 1 ^ clauses, 
where c is a large constant, —>A has subexponentially bounded AC°-Frege proofs. 

Proof. By Theorem 13.31 the polylogarithmic Cut of any V°-model is a model of VNC 1 , 
therefore also of VTC°. This yields, as in our proof of Theorem 14. 1} that AC°-Frege 
subexponentially simulates TC°-Frege. The result now follows from Proposition 14.21 □ 
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5. Conclusion and Discussion 

As we have seen cuts of models of weak arithmetics constitute an appropriate way for 
reasoning about super-polynomial simulations between proof systems. An advantage in 
comparison to syntactic arguments is the possible applicability of results in Model Theory 
and a more uniform treatment. This can readily be observed as with our argument, e.g. 
the work of Paris and Wilkie |17j |18j immediately imply the simulation results from Bonet 
et al. [3]. 

This leads to interesting possibilities for further research, especially towards the weak 
automatizability of weak propositional proof systems such as Resolution. The underlying 
theory, which was V° in our argument, must be significantly weakened, however. If we could 
take Tl(a) as our base theory, we could reason about whether f?es(log) has the feasible 
interpolation property in the same way as Krajicek and Pudlak |13j . Bonet, Pitassi and Raz 
[3] or Bonet, Domingo, Gavalda, Maciel, and Pitassi [3J. Now, if Res(log) does not have 
quasi-polynomial feasible interpolation we know by a result from Atserias and Bonet [2] that 
Resolution is not weakly automatizable, so we would be finished. Whether we can actually 
do it depends on the strength of the theory the polylogarithmic cut of Tf(a) models and if 
we can formalize some sort of iterated multiplication (such as in [11]) in that theory. Also, 
the security of Diffie-Hellman seems to be a more appropriate assumption than that of RSA, 
as the computational power needed to verify the correctness of Diffie-Hellman seems to be 
lower. 
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